A General Framework for Content-enhanced Network Representation Learning

نویسندگان

  • Xiaofei Sun
  • Jiang Guo
  • Xiao Ding
  • Ting Liu
چکیده

This paper investigates the problem of network embedding, which aims at learning low-dimensional vector representation of nodes in networks. Most existing network embedding methods rely solely on the network structure, i.e., the linkage relationships between nodes, but ignore the rich content information associated with it, which is common in real world networks and beneficial to describing the characteristics of a node. In this paper, we propose content-enhanced network embedding (CENE), which is capable of jointly leveraging the network structure and the content information. Our approach integrates text modeling and structure modeling in a general framework by treating the content information as a special kind of node. Experiments on several real world networks with application to node classification show that our models outperform all existing network embedding methods, demonstrating the merits of content information and joint learning. Introduction Network embedding, which aims at learning lowdimensional vector representations of a network, has attracted increasing interest in recent years. It has been shown highly effective in many important tasks in network analysis involving predictions over nodes and edges, such as node classification (Tsoumakas and Katakis 2006; Sen et al. 2008), recommendation (Tu, Liu, and Sun 2014; Yu et al. 2014) and link prediction (Liben-Nowell and Kleinberg 2007). Various approaches have been proposed toward this goal, typically including Deepwalk (Perozzi, Al-Rfou, and Skiena 2014), LINE (Tang et al. 2015), GraRep (Cao, Lu, and Xu 2015), and node2vec (Grover and Leskovec 2016). These models have been proven effective in several real world networks. Most of the previous approaches utilize information only from the network structure, i.e., the linkage relationships between nodes, while paying scant attention to the content of each node, which is common in real-world networks. In a typical social network with users as vertices, the user-generated contents (e.g., texts, images) will serve as rich extra information which should be important for node representation and beneficial to downstream applications. Figure 1 shows an example network from Quora, a community question answering website. Users in Quora can follow each other, creating directed connections in the network. How does the shape

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image Classification via Sparse Representation and Subspace Alignment

Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...

متن کامل

Detecting Overlapping Communities in Social Networks using Deep Learning

In network analysis, a community is typically considered of as a group of nodes with a great density of edges among themselves and a low density of edges relative to other network parts. Detecting a community structure is important in any network analysis task, especially for revealing patterns between specified nodes. There is a variety of approaches presented in the literature for overlapping...

متن کامل

Weakly-Supervised Deep Learning for Customer Review Sentiment Classification

Sentiment analysis is one of the key challenges for mining online user generated content. In this work, we focus on customer reviews which are an important form of opinionated content. The goal is to identify each sentence’s semantic orientation (e.g. positive or negative) of a review. Traditional sentiment classification methods often involve substantial human efforts, e.g. lexicon constructio...

متن کامل

A Framework for Longitudinal Influence Measurement between Communication Content and Social Networks

Artificial intelligence has a long history of learning from domain problems ranging from chess to jeopardy. In this work, we look at a problem stemming from social science, namely, how do social relationships influence communication content and vice versa. The tools used to study communication content (content analysis) have rarely been combined with those used to study social relationships (so...

متن کامل

Neural Network Meta-Modeling of Steam Assisted Gravity Drainage Oil Recovery Processes

Production of highly viscous tar sand bitumen using Steam Assisted Gravity Drainage (SAGD) with a pair of horizontal wells has advantages over conventional steam flooding. This paper explores the use of Artificial Neural Networks (ANNs) as an alternative to the traditional SAGD simulation approach. Feed forward, multi-layered neural network meta-models are trained through the Back-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1610.02906  شماره 

صفحات  -

تاریخ انتشار 2016